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Abstract. For any Z 2 nearest neighbor shift of finite type X and any integer 
n > 1, one can define the horizontal strip shift H n (X) to be the set of configu- 
rations on Z X {1, . . . , n} which do not contain any forbidden transitions for X . 
Each H n (X) can be considered as a Z nearest neighbor shift of finite type, and 
it is always the case that limn^oo j^iggg)) = h top {X). In this paper, we 
combine ergodic theoretic techniques with methods from percolation theory 
and interacting particle systems to show that for the Z 2 hard square shift H, 
it is in fact the case that lining h top (H n+1 {H)) - h top {H n {H)) = h to P(H), 
and that the rate of convergence is at least exponential. A consequence of this 
is that h top (H) is computable to any tolerance — in time polynomial in n. We 
also give an example of a Z 2 block gluing nearest neighbor shift of finite type 
Y for which h top (H n j l .i (Y)) — h top (H n (Y)) does not even approach a limit. 



1. Introduction 

Some of the most studied objects in the field of symbolic dynamics are shifts 
of finite type (or SFTs.) A Z d SFT is defined by specifying a finite set A, called 
the alphabet, and a set of forbidden configurations. For any such specification, 
the associated Z d SFT is the set of configurations in A 1, in which no forbidden 
configuration appears. In this paper, we will mostly concern ourselves with nearest 
neighbor SFTs, which are SFTs for which all forbidden configurations are just pairs 
of adjacent letters. 

To any Z d SFT X, one can assign a real number h top (X), called its topological 
entropy. Informally, h top {X) measures the exponential growth rate of the number of 
configurations which appear in points of X . (We postpone a formal definition until 
Section[2l) Topological entropy is quite easy to compute for Z SFTs; to any Z SFT 
X, one can associate a 0-1 matrix called its transition matrix, and h top (X) is just 
the logarithm of the Perron eigenvalue of this matrix. For a general introduction 
to one-dimensional symbolic dynamics and topological entropy, see [LinMj . 

In general, it is much harder to compute h top (X) for Z 2 SFTs. In fact, there are 
very few nondegenerate examples of Z 2 SFTs for which the topological entropy has a 
known closed form. ( |Ba2j . |FiT] . |Kas] . [Liebj ) However, one can approximate such 
a topological entropy by using the easier to compute one-dimensional topological 
entropies. For any Z 2 nearest neighbor SFT X with alphabet A, one can define 
H n {X) to be the set of configurations on Z x {1, ... ,71} which contain no forbidden 
pair of adjacent letters. Then H n (X) can be considered as a Z nearest neighbor 
SFT with alphabet the set of legal n-high columns in X, which we call A n (X). 
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a n b n 

Two letters : and : in A n (X) may appear consecutively in H n (X) if and only 
»i 61 

if : is legal in X. We can then define h n (X) := h top (H n (X)), the topological 

ai&i 

entropy of H n (X) as a Z SFT. One can approximate h top (X) via h n (X); it turns 
out to be true that — > h top (X) for any X. (This is Lemma [T] from Section [3] 
and we postpone the proof until then.) 

One well-studied example of a Z 2 nearest neighbor SFT is the Z 2 hard square 
shift TL, which is the Z 2 nearest neighbor SFT with alphabet A = {0, 1} where the 
only forbidden pairs of letters are two adjacent Is horizontally or vertically. Since 
this is the main SFT we study in this paper, we denote h to,p (H) by h, h n (W) by 
hn, H n (H) by H n , and A n (H) by A n . 

There is no known closed form for the topological entropy h of the hard square 
model, which is also known as the hard square entropy constant. However, there 
is quite a bit of literature regarding bounds and approximations to h. (see |Baj . 
[CalWj . [E] . |Fo J] ) There is, for instance, an algorithm ([Pi]) that lets a computer 
generate the transition matrix for H n for any n. One can then use these matrices 
to compute the sequence h„, and use the fact that ^ — > h to approximate h. 
Interestingly, empirical data ([E], [Pi]) indicates that the differences h n+ i — h n 
converge much more quickly to h; ^ seems to converge at a linear rate, whereas 
h n+ i — h n seems to converge exponentially fast. To our knowledge however, even a 
proof of the convergence of h n+ \ — h n has been an open problem. Our main result 
shows that this convergence does in fact occur with exponential rate. 

Theorem 1. lim n ^oo — h n = h, and the rate of this convergence is at least 
exponential. 

Interestingly, to prove this entirely combinatorial or topological result, we will 
be using an almost entirely probabilistic or measure-theoretic proof. We use several 
techniques from the worlds of probability and interacting particle systems, whose 
definitions and exposition are contained in Section |3] Our proof relies heavily on 
some results and techniques from [vdBS . 

These techniques are quite powerful and have been used to prove results from 
symbolic dynamics and ergodic theory before; see |BuS] . [vdBSj . [Ha] , and [Ha2j . 
It is our hope that the applications of interacting particle system methods used in 
this paper will inspire more work on the fascinating interplay between statistical 
mechanics and symbolic dynamics. 

2. Definitions 

We here lay out the necessary definitions and terminology for the rest of the 
paper. An alphabet A will always be a finite set with at least two elements. 

Definition 1. The Z d full shift on the alphabet A is the set A zd . For any full shift 
A 1 ' , we define the Z d -shift action {o~ v } v(£ %d on A z as follows: for any v G Z d 
and x E A 1 ' , (a v (x))(u) = x(y + u) for all u E 7L d . 

Definition 2. A Z d subshift on an alphabet A is a set X C A 1 ' with the following 
two properties: 
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(i) X is shift-invariant, meaning that for any x € X and v £ Z d , a v (x) £ X . 

(ii) X is closed in the product topology on A z . 

When the value of d is clear, we will sometimes omit the Z d and just use the 
term subshift. 

A configuration u on the alphabet A is any mapping from a non-empty subset 
S of Z d to A, where S is called the shape of u. For any configuration u with shape 
S and any T C S, denote by u\t the restriction of u to T, i.e. the sub-configuration 
of u occupying T. 

For any integers a < b, we use [a, b] to denote {a, a + 1, . . . , b}. 

Definition 3. A Z d shift of finite type (or SFT) X is defined by specifying a 
finite collection J- of finite configurations on A, and then defining X — (A z )jr to 
be the set of x £ A z such that x\s £ J 7 for all finite S C Z d . For any fixed X , 
the type of X is the minimum positive integer t such that for some T consisting 
entirely of configurations with shape [l,t] d , X = (A 1, )jr. 

It is not hard to check that any SFT is a subshift. 

Sites u, v £ Z d are said to be adjacent if Y^,i=i \ u i — = 1. If a Z d SFT X has 
forbidden list T consisting entirely of pairs of adjacent letters, then X is called a 
Z d nearest neighbor SFT. In this paper, we will mostly concern ourselves with 
d = 1 or d — 2, and all SFTs we consider will be nearest neighbor SFTs. 

Definition 4. The Z d hard square shift is the nearest neighbor SFT on the 
alphabet {0, 1} whose forbidden list J- consists of all pairs of adjacent Is in any of 
the d cardinal directions. 

Definition 5. In a nearest neighbor SFT X with alphabet A, a £ A is a safe 

symbol if none of the forbidden configurations in J- contain A. In other words, a 
is a safe symbol if it may legally appear next to any letter of the alphabet in any 
direction. 

For example, is a safe symbol for the Z d hard square shift. 

Definition 6. For any Z d SFTX with forbidden list T and any finite configuration 
w with shape S, w is locally admissible in X if w\t ^ F for all T C S, and w is 
globally admissible in X if there exists x £ X for which x\s = w. 

The difference between local and global admissibility is subtle but quite pro- 
nounced. It is always quite easy to check whether a configuration is locally admis- 
sible, and for Z SFTs also to check global admissibility. However, for Z 2 SFTs, 
the question of whether or not a configuration is globally admissible is undecidable. 
In other words, there does not exist an algorithm which takes as input the set of 
forbidden configurations J- and a locally admissible configuration w, and gives as 
output an answer to the question of whether w is globally admissible. QBej . [Wan] ) 

In this paper, we will mostly be concerning ourselves with the Z 2 hard square 
shift, which we denote by H. All locally admissible configurations in T-L are globally 
admissible, since a locally admissible configuration in % can always be completed 
to a point of H by filling the rest of Z 2 with 0s. For this reason, we will just refer 
to any locally admissible or globally admissible configuration in T-L as admissible. 

Definition 7. The language of a subshift X , denoted by L(X), is the set of globally 
admissible configurations in X . The set of configurations with a particular shape S 
which are in the language of X is denoted by Ls{X). 
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Definition 8. The local language of any If 1 SFT X with forbidden list J 7 , denoted 
by LA(X), is the set of all locally admissible configurations in X. The set of 
configurations with shape S which are in the local language of X is denoted by 
LA S {X). 

For any configuration u with shape S in L(X), denote by [u] the set {x G X : 
x\s — u}, called the cylinder set of u. 

Definition 9. The topological entropy of a T, d subshift X , denoted by h top (X), 
is defined by 



Jl,j2,.",3d-K» 3lj2---Jd 

To see why the limit exists, note that the function f(ji,.-.jd) '■= hil^rr? [i,j 4 ]POI 
is subadditive in each coordinate, i.e. for every i G [1, d] and a, b > 0, 

f{ji, ■ • -,ji-i,a + b,j i+ i, . . . ,jd) < f(ji, ■ ■ ■ ,ji-i,a,j i+1 , . . .,j d ) 

+ /(iii • • • , 3i-i,b, ji+i, . ■ . , jd)- 

The classical Fekete's subadditivity lemma implies that for any subadditive func- 
tion f(n) of one variable, lim„_ s . 00 exists. A multivariate version, which can 
be found in [Cap], shows that for any function f{j\, . . . ,jd) which is subadditive in 
each variable, 

lim Rn Ml) 

ji,...,jA-Hx> 3x3i... jd 

exists (and is invariant of how each ji — > oo), and that the limit is equal to the 
infimum. For Z d SFTs, topological entropy may also be computed by using the 
local language, i.e. if Lj-jd ^ ? .. (X) is replaced by LA^d , t (X) in the definition 
of topological entropy, the limit is unchanged. ([Fr], |HocMj ) 

We will also need some definitions specific to the arguments used in this paper. 

We will frequently consider Z d as a graph, where two sites are connected by an 
edge if they are adjacent. For any set S C Z d , we identify S with the maximal 
subgraph of Z d with vertex set S, i.e. the graph with vertex set S and edges 
between all pairs of adjacent vertices in S. 

For any subset G of Z d , and any set S C G, the boundary of 5* within G, which 
is denoted by <9(5, G), is the set of p e G \ S which are adjacent to some q € S. If 
we refer to simply the boundary of a set S, or write dS, then G is assumed to be 
all of 7L d . 

For any integer i, we define Ri = Z x {i}, the row at height i. 

For any partition £ of a set S, and for any s 6 5, we use £(s) to denote the 
element of £ in which s lies. If £ is a partition of the alphabet A of a Z d subshift 
X, then <f)£ is the factor map from X to £ z defined by (4>^x)(v) = £(x(v)) for all 
u G Z d . 

3. Some preliminaries 
We begin by justifying a claim from the introduction. 
Lemma 1. For any X, lim^^ = h to P(X). 
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Proof. By the comments following the definition of topological entropy, h top (X) — 



inf m ,™ and therefore |L [1)m]x[1 , n] (X)| > e ht ° P ^ mn for all m,n G 

N. By the definition of h n (X) = lim m -,oo ' d|l "^"'"1 wi , Therefore, 

^p- > h top (X) for all n. Fix any e > 0. By definition of h to P(X), there exists 
iV so that for any m,n > N, \L [1:Trl]x[1>n] (X)\ < e (h top (x)+,)rnn _ Thig meang that 



' < h top (X) + e for n > N. Since e was arbitrary, we are done. 

■ 

We use several measure-theoretic or probabilistic tools in the proof of Theorem[TJ 
chiefly the concepts of percolation, measure-theoretic entropy, stochastic domina- 
tion, Gibbs measures, and the d metric. We define these notions and state some 
fundamental theorems relating them in this section. All measures on subshifts con- 
sidered in this paper are Borel probability measures for the product topology on 



We begin by giving a few notations and facts about independent site percolation 
which will be necessary for our proof. For a detailed introduction to percolation 
theory, see |Grj . 

Definition 10. For any < p < 1 and any infinite connected graph G — (V(G), E(G)), 
the independent site percolation measure on G, denoted by P P: g, is the mea- 
sure on {0, which independently assigns a 1 with probability p and with 
probability 1 — p at every site in V{G). 

Often a site with a 1 is said to be open and a site with a is said to be closed. 
We define the event A where there exists an infinite connected cluster of Is in G, 
and say that A is the event where percolation occurs. One of the foundational 
principles of percolation theory is that for any countable locally finite graph, there 
exists a probability p c (G), called the critical probability for site percolation 
on G, such that for any p < p c {G), P p y(G){A) = 0, and for any p > p c (G), 
Pp,v(G)(A) > 0. We most often take G to be the graph representation of I? as 
described earlier, which is often called the square lattice in the literature. For 
this reason, the notation P p with no graph G will always be understood to represent 
P Vi t?i and p c will represent p c {l?). It was shown in |Hij that p c > 0.5, and there 
have been successive improving lower bounds onp c since then. ( [ MeFj . [T] . [ydBE , 



In this paper, we will be concerned only with the case p < p c , where percolation 
occurs with probability 0. If G is the square lattice, then this of course implies that 
P p (0 o d([— n, n] 2 )) — > as n — > oo, where for any S C Z 2 , 5 represents the 
event where there is a connected path of Is starting at and ending at a point in 
S. In fact, an even stronger statement can be made. The following is a classical 
theorem from percolation theory, proved by Menshikov. ( |Mej ) 

Theorem 2. On the square lattice, for any p < p c , there exist A and B so that 
P p (0 <-> <9([-n,n] 2 )) < Ae- Bn for all n. 

We now turn to measure-theoretic entropy and conditional measure-theoretic 
entropy, beginning with finite partitions. For any finite measurable partitions £ 
and n of a measure space (X, a), we make the definitions 




.Z) 
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where terms with fJ,(A) = are omitted from the first sum and terms with [i(A n 
C) = are omitted from the second sum. 

The following decomposition result can be found in any standard book on ergodic 
theory (such as | Wal) ) : 

Theorem 3. For any measure space (X,fj,), any measurable partition rj of X , and 
any partition £ of X which is a refinement of rj, = Hp(rj) + Hp(£ \ rj). 

For any measure /i on a Z d subshift which is stationary, i.e. n{B) = n(o~ v B) for 
all v € Z d and measurable B, we may define its entropy. 

Definition 11. For any finite alphabet A and stationary measure fi on A z , the 
measure-theoretic entropy of fj, is 



h{fi) = lim — - — -Hp V/ °vV 

\veUtilhji] 



■r-h j- ►« J1J2 ■■■3d 



where V is the partition of X into cylinder sets determined by the letter at x(0). 
(i.e. each element ofV is [a] for some a G A.) 

Again, this limit exists (independently of how each j\ — > oo) and is equal to 
its infimum by the coordinatewise subadditivity of the function g(ji, . . . , jd) '■= 

Hp (\/ V £Yi d ] a vP^ and the already mentioned multivariate generalization of 
Fekete's subadditivity lemma found in |Cap| . 

Alternately, we can write measure-theoretic entropy more concretely: 

Mm)=. . lim . . 1 . Y] M(M)inM(M)) 

31 ,32,—,U->-°° Jlj2 ■ • ■ 3d ~T 

where terms with n([w]) =0 are omitted. 

We will also deal with measure-theoretic conditional entropy. 

Definition 12. For any finite alphabet A, any stationary measure fi on A 1, , and 
any measurable partition £ of A z , the measure-theoretic conditional entropy 

of [i with respect to £ is 



h(fi | = . , lim . . 1 . Hp W a v V | W a v £ 

•»i »o n j — ^rvi 1i To • • • 1 j I v " 



3i,h,—,3d-Hx> ]1]2 ■ ■ ■ 3d 



where again V represents the partition of X into cylinder sets determined by the 
letter at x(0). 

Note that when £ is the partition {0,X}, (i.e. £ "contains no information") 
h(a | = h(u). 

Again there is a more concrete representation for conditional measure-theoretic 
entropy. We will only deal with the case where £ is a coarser partition than V, in 
which case £ corresponds to some partition of A in an obvious way and we will say 
£ was induced by this partition of A. For such £, 

/<(/< . Hm — — J2 >'■ "■ - A » ( " { ir - ] 



hh "' U W JUM K " ((V, || : :„ -0 N) 

where terms with /i ( ( V u en d il 1 id t^l ) = ^ are om itted. 
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We note that for any measure (iona full shift A 1, and any partition £ of A 1, 
induced by a partition of A, the push- forward </>{(//) of \x under the factor map 4>£, 

(i.e. the measure <^0) defined by (^(u))(C) = (i U^c) for all C C <f d for 

which <\>^ X C is measurable) is a measure on £ z . The following proposition follows 
immediately from Theorem [3] and the definitions of /i(/x | £), and <f>^: 

Proposition 1. For any finite alphabet A, any stationary measure /i on A 1 , and 
any partition £ of A, 

%) = /i( M | o + HMri)- 

Measure-theoretic entropy and topological entropy are related by the following 
Variational Principle. (See |Mij for a proof.) 

Theorem 4. For any Z d subshift X, h top (X) = swph(/j,), where /i ranges over 
measures whose support is contained in X. This supremum is achieved for some 
such u. 

Definition 13. A stationary measure [i supported on a subshift X is called a mea- 
sure of maximal entropy if h(u) = h top (X). 

Measures of maximal entropy will be useful in the proof of Theorem [TJ since we 
can rewrite the topological entropies in the theorem as measure-theoretical entropies 
with respect to measures of maximal entropy. Measures of maximal entropy on 
nearest neighbor SFTs also have another extremely useful property. 

Definition 14. For any finite alphabet A and countable locally finite graph G = 
(V(G), E(G)), a measure fj, on A v ^ is called a G-Markov random field (or G- 
MRF) if for any finite S C V(G), any n € A s , any finite T C {V{G) \ S) s.t. 
d(S, G) C T, and any S £ A T with fi([S]) ^ 0, 

n(x\ s = V I x\ d ( S ,G) = $\d(S,G)) = M(^ls = V I x \t = S). 

Informally, fj, is an MRF if, for any finite S C V(G), the sites in S and the sites 
in V(G) \ (S U d(S, G)) are //-conditionally independent given the sites on d(S, G). 
We note that our definition of MRF differs slightly from the usual one, where the 
right-hand side would involve conditioning almost surely on an entire configuration 
on V(G) \ S rather than arbitrarily large finite subconfigurations of it. However, 
the definitions are equivalent and the finite approach leads to simpler calculations 
and proofs. 

Proposition 2. ( |BuS2j . p. 281, Proposition 1.20) For any Z d nearest neighbor 
SFT X , all measures of maximal entropy for X are Z d -MRFs, and for any such 
measure fj, and any finite shape S C the conditional distribution of ft on S given 
any 5 £ L d ^ s ^ Z d^(X) is uniform over all configurations x € Ls(X) such that the 
configuration y defined by y\s — x and y\d(S,z d ) — & * s locally admissible in X. 

In fact we will only use Proposition [2] for d = 1, where it is a much more 
classical fact (|Paj). but we state it in full generality here because the conclusion of 
Proposition [2] is related to the well-studied Gibbs measures from statistical physics. 
In |vdBS] . they study a more general class of measures; in their language, a measure 
on a T, d hard square shift satisfying the conclusion of Proposition[2]is called a hard- 
core measure with all activities ai equal to 1. 
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Definition 15. For any connected subgraph G of the square lattice, /i is a uni- 
form hard-core Gibbs measure on G if it is a G-MRF such that for any finite 
connected set B C G and any admissible 5 G Ld(B,G){H), ^{x\b = ot \ x\gm g) = ^) 
is uniform over all a G A B which are admissible given 8, i.e. the configuration y 
defined by y\s = ct and y\d(B.G) = S is in L{%). 

Theorem 5. For every infinite connected subgraph G of the square lattice, there is 
a unique uniform hard-core Gibbs measure on S . 

Proof. Theorem 2.3 in (vdBSj implies (in the case where all a; = 1 in their notation) 
that for any such G, there is a unique uniform hard-core Gibbs measure on G if 
percolation occurs with probability with respect to Po.5,G- We recall that p c > 0.5, 
and since G C Z 2 , clearly p c (G) > p c > 0.5, and by definition of p c (G) we are done. 

■ 

In fact, we will eventually be able to represent uniform hard-core Gibbs measures 
on infinite subgraphs G of Z 2 as weak limits of uniform hard-core Gibbs measures 
on finite S, but for this we will need the notion of stochastic dominance. We first 
need to define the notion of a coupling of a finite set of measures. 

Definition 16. For any n and any probability spaces (JQ, i 6 [1, n], a coupling 

of ui, (i2, ■ ■ ■ j fJ"n * s a measure A on ]X =1 Xi such that for any j G [1, n] and any 
fj, j -measurable B C Xj, 

A-l n \ 

A \\X X xBx 11 X k \ =ii 3 {B). 

\i=l k=j+l ) 

We present two equivalent definitions of stochastic dominance, both of which 
depend on a partial order < on the compact space ^4 for some set S. We will 
always assume < to be closed, i.e. {(x,y) : x < y} C (^4 S ) 2 is closed. The 
equivalence of these definitions is originally due to a result of Strassen (Theorem 11 
in [St], where in his language, S = T and e = 0); also sec Ligg for a shorter proof 
of this equivalence (Theorem 2.4 in [Ligg]) and a general introduction to interacting 
particle systems. 

Definition 17. For any set S , any partial ordering < on A s , and any measures \i 
and v on u < v ([i is stochastically dominated by v with respect to <) if there 
exists a coupling A of \x and v for which \({(x,y) £ (A s ) 2 : x < y}) = 1. 

Definition 18. For any set S, any partial ordering < on A s , and any measures 
H and v on A s , H < v (u is stochastically dominated by v with respect to <) if 
for any increasing bounded continuous function f from A s to R (f is increasing if 
f{x)<f{y) ifx<y), E^(f)<E u {g). 

We will repeatedly make use of three important properties of stochastic domi- 
nance. 

Lemma 2. For a partial ordering < on A s , define a relation <t on A T by restrict- 
ing < to T . (i.e. x <t y if there exist x' , y' G A s such that x'\t — x, y'\x — y, and 
x' < y' ) If <t "is a partial order, then for any measures fi < v on A s , iAt <t v\t- 

Proof. This is an obvious consequence of the first definition of stochastic dominance; 
simply marginalize the coupling A from the first definition of stochastic dominance 
to get a coupling At of /i\t and v\t with support contained in {(x,y) G (A T ) 2 : 
x <t y}- 
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Lemma 3. Stochastic dominance is preserved under weak limits; i.e. if fj, n — > /i, 
v n — > v weakly, and fi„ < v n for all n, then /i < v. 

Proof. This is an obvious consequence of the second definition of stochastic domi- 
nance. 

■ 

Lemma 4. If a sequence of measures {/J, n } on A s is stochastically monotone (i.e. 
either fj, n < /i„+i for all n or /i„+i < fJ, n for all n), then {/i„} approaches a weak 
limit [i. 

Proof. We assume that {fJ, n } is a stochastically increasing sequence, since the proof 
is nearly identical for the decreasing case. Since A s is compact, there exists a 
subsequence of /i„ which approaches a weak limit. Consider any two subsequences 
of {/J, n } which each approach weak limits, say fi nk — > fJ- and Li mk — > Then, by 
passing to subsequences again if necessary, we can assume that m < mi < 112 < 
to 2 < ■ ■ ■■ Since \i n is stochastically increasing, fi ni < fi mi for all i and fj, mi < fJ-n i+1 
for all i. 

By Lemma [21 this means that // < fx and Li < fjf, so fi = fjf. This means that 
all weakly convergent subsequences of approach the same limit, and so the 

sequence itself weakly converges. 

■ 

We now define a partial order which is particularly relevant to T-L. We think of 
1? as being colored like a checkerboard; (x, y) G 1? is colored black if x + y is even 
and white if x + y is odd. We define a site-dependent ordering of {0, 1}; for any 
t) £ Z 2 , ^„ is defined as < v 1 if v is black, and 1 < v if v is white. We use this 
site-dependent ordering to define a partial ordering on {0, 1} S for any S C Z 2 : for 
any x, x 1 £ {0, 1} S , x ^ x' if x(v) ^< v x'{v) for all v S S. This in turn defines the 
stochastic dominance partial ordering on measures on A s with respect to which 
we also denote by ^. 

For any rectangle R and S e Lqr(H), wc define a probability measure fi 5 on 
{0, 1} R which assigns equal probability to all configurations x such that the config- 
uration y £ {0, l} RudR defined by v\r = x and y\dR = S is admissible. (Note that 
by Proposition [21 ^ A is just the conditional probability distribution on R, given S, 
w.r.t. the measure of maximal entropy fi for %.) We define a special class of exam- 
ples: for any u,d,£,r £ {0,+,—} and any rectangle R, define S^ d ' ' r £ Lq^T-L) as 
follows: the symbols u,d,£,r determine boundary conditions adjacent to the top, 
bottom, left, and right edges of R. A + means that the sites adjacent to that edge 
of R are maximal with respect to ^, i.e. on white squares and 1 on black squares. 
A — means that the sites adjacent to that edge of R are minimal with respect to ^, 
i.e. 1 on white squares and on black squares. A means that the sites adjacent 
to that edge of R are all 0. We then define /j,^ d ' £ ' r to be ^l s r ' ' . 

The following theorem states that for the partial order < : comparability between 
two admissible boundary configurations implies stochastic dominance comparability 
between their associated measures. The theorem is a corollary of Lemma 3.1 from 
vdBS , and the proof is similar to that of Holley's theorem ([Hoi ) for the Ising 
model. 

Theorem 6. For any rectangle R and S,rj £ Lqji(H) such that S ^ n, /i s ^ fi n . 
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We can use Theorem [6] to derive stochastic dominance relationships between 
some of the measures y^' 1 ^ for different-sized rectangles. 

Theorem 7. For any integers k < k < £ < £' and m! < m < n, define rectangles 
R = [k,£] x [m,n] and R' = [k',£'] x [m',n]. Then h and 

0,-,-,- J o, 

^ MiJ' \R- 

Proof. We prove only the first inequality, as the second is similar. Our proof 
mirrors the proof of Proposition 2.5 from |BuS) . Since R C i?', we may write 

H R ,' ' |_r = \i Rl \r as a weighted average of the measures , where n ranges 
over all admissible configurations in TL on dR whose top edge is labeled by Os. 
By Theorem |H1 each term in this weighted average is stochastically dominated by 
f°R +,+ ' + w ith respect to ^, and therefore h A t fl' +,+ ' + |fi- 

■ 

The proofs of the following two theorems are almost identical. 

Theorem 8. For any integers k' < k < £ < £' and m < n < n' , define rectangles 
S = [k,£] x [m,n] and S' = [k',l'] x [m,n']. Then h ^°' + ^ + \s and 

-.0,-,- , -,0,-,-i 

Theorem 9. For any integers k! < k < £ < £' and m < n, define rectangles 
T = [k,£] x [m,n] and V = [k',£'} x [m,n]. Then ^'°' + ' + t ^?< + ' + \ T . 

We will also make use of the d~ topology on probability measures on a full shift A z . 
There are many different definitions for the d metric (for a thorough introduction 
to the subject, see (Rj), but the one which we will find most useful is the following. 

Definition 19. For any stationary measures fi and // on A z , 

d((j,,fj,') = min / di(a;(0),2/(0)) d\{x,y), 

where C(p,,fj,') is the set of stationary couplings of fi and /i' and d\ is the 1-letter 
Hamming distance given by d\{a 1 a) = and d\(a, b) — 1 for a =/= b. 

The d metric is useful for our purposes because of the nice behavior of measure- 
theoretic entropy in the d topology. We first need a definition: 

Definition 20. A stationary measure \l on a Z subshift X is ergodic if for 

any shift-invariant measurable set A C X. i.e. a measurable set A for which 
/i(AA<t„j4) = for all n, [i(A) is or 1. 

The following is Theorem 7.9 from [R] . 

Theorem 10. For any finite alphabet A and ergodic stationary measures /i and v 
on A z , ifd(fi, v) = e, then \h(fj,) - h{v)\ < eln \A\ — elne - (1 - e) ln(f - e). 

4. Main body 

We now restrict our attention to the hard square shift % and will use our prelim- 
inaries to prove some results about measures of maximal entropy on the Z nearest 
neighbor shifts of finite type H n . By Theorems [7] and [H for any fixed m < n and any 

fixed K, the sequences (m"^'^"]) \[-K,K]x[m,n] and (m^ JJ^J \[-K,K]x[m,n] 
are monotonically decreasing in the stochastic dominance ordering -< as k — > oo. 
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By Lemma IH this implies that for any K , these sequences approach weak limits, 

and so M°lt'fc]'^[m,n] and A t HM]'x[m,n] a PP roacn wea k limits, denoted by fi+ n n and 

+ 

/Lt^ n respectively. An almost identical proof (but with monotonically decreasing 
marginalizations) shows that (J-\Lk^]x[ m , n ] an d M[IV,jfe]'x[m,n] a ^ so approach weak lim- 

- 

its as h — y oo, which we denote by fJ,^ n and /U^ „ respectively. Finally, by using 
Theorem [9] instead of Theorems [7] and [8l we see that /V ^ r m „i approaches a 



weak limit as k ~ > oo, which we denote by n° mn . (Note: Technically, to discuss 
weak limits, we need all measures to live on the same space; to deal with this, 
we could extend each measure to {0, l} Zx ''"'"1 by simply appending 0s to every 
configuration in the support.) 

+ 

Lemma 5. For any integer n, fi^ n ^ fi° l n ^ ^ n and ^ n ■< n\ >n ■< fi^ n . 

Proof. We prove the first set of inequalities only, as the second is similar. For any 

fixed k, ^Lk'^Mun] =< ^]x[i, n] =< M^'xtM b ^ Theorem© By letting k -»• oo 

o 

and using Lemma [31 A*i n ^ Mi n ^ Mi n- 

■ 

-0 

Lemma 6. For any integer n, [i\ n < Mo,„|zx[i,n] d Mi,n anrf Mi, n ^ M°,n+i|zx[i,n] d 
+ 

Ml,n- 

Proof. We again prove the first set of inequalities only, as the second is similar. 
For any fixed k, t\l^~ {1 , n] < ^[-k^]x%n] |[-fc,fc]x[i,n] ^ M[-^']x[0,n] ![-*,*] x[i,n] ^ 
A*Hk'J]'5o,fi]l[-*.*]x[i,n] ^ /'KSxVl by Theorems [H and H By letting fc -> oo 

o 

and using LemmaH ^ n < /^,Jzx[i,n] < Mo,Jzx[i,n] ^ Mo,nlzx[i,n] ^ /"i, n) and by 
removing the second and fourth expressions we are done. 

■ 



Theorem 11. For any n, /i^ n is the unique measure of maximal entropy on H n , 
and is ergodic. 

Proof. By Proposition [21 for any measure \i of maximal entropy on a Z nearest 
neighbor SFT Y and for any a,b letters in the alphabet of Y, fj,(x\[ m+ i tn _xi = 
a | x(m) — a, x(n) = b) is uniform over all admissible configurations a given a 
and b. We claim that this implies that any measure of maximal entropy fi on 
the Z nearest neighbor SFT H n , when considered as a measure on {0, l} Zx [ 1,n l, 
is a uniform hard-core Gibbs measure on Z x [l,n]. To see this, consider any 
finite configurations w,w' £ {0, i} 5 ' u, 9(S\zx[i,n]) f or some finite S C Z x [l,n] such 
that u)\gfs,Ex[x,n]) — w '\d(S,zx[i,n})- Then, choose any interval [l,r] so that SU 
d{S,Zx [l,n]) C [l,r] x [l,n], and any configurations L e {0, and R g 
{0, l}{'+i}x[i.»] so that fji([L] n [i?]) > 0. Then, by Proposition EJ all configurations 
in .r+i]x[i,n](^) which have L on the left edge and R on the right have the 
same //-measure, and so fJ,([w] fl [L] n [i?]) depends only the proportion of such 
configurations which have restriction w on SL)d(S, Z x [1, n]). However, since H is a 
nearest neighbor SFT, this proportion depends only on the letters on d(S,Zx [1, n}), 
and so n([w] n [L] fl [i?,]) = /x([?z/] n [L] fl [R]). By summing over all such L, R, we see 
that /i([tu]) = n([w']), and so \i is a uniform hard-core Gibbs measure on Z x [l,n]. 
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o 

By its definition as a weak limit, it is not hard to check that [i\ n is also a uniform 
hard-core Gibbs measure on Z x [l,n], and by Theorem [5J there is only one such 



measure. Therefore, n\ n is the unique measure of maximal entropy on H n . It is a 
standard fact (Pa) that when a Z SFT has a unique measure of maximal entropy, 
it is ergodic. 

■ 



We note that by the definition of ergodicity, any marginalization [i\ n \\j eA B, a 
is also ergodic for A C [l,n]; a shift-invariant set with nontrivial measure for 

_ _ 

Hi n \\j eA R a would yield a shift-invariant set with nontrivial measure for fj,° n . 
Theorem 12. For any k, n, any even i £ [1, n], and any even j £ [— k, k], 



^ ^ fe ; fe ]' x [i, n] = 0) - ^k^ lhn] (x(j, i) = 0) 

< 2P . 5 ((j,i) O d(([-k,k] x [l,n]),Z x (-co,n])) and 



^ /*[Jfc , ,fc]'x[l,n]( a! 0'.*) = °) - /*hfc;fc]x[l > n]( a! 0'.*) = °) 

<2P . 5 {{j,i)^d(([-k,k} x [l,n]),Zx [l,oo))). 

T/ie order of the terms in the central differences are reversed when the parity of i 
or j changes. 

Proof. We prove only the first set of inequalities, as the second is completely 
analogous. For ease of notation, we write pb = f^-k ^xli nl an< ^ ^ = ^[-^'fc]'x"[i «]• 
Since fi ^ /i' by Theorem [51 and since the function X{i(j,i)=o} is a decreasing 
bounded continuous function on TL with respect to ^, the inequality < n(x(j, i) — 
0) — fi'(x(j,i) — 0) is clear by the second definition of stochastic dominance. 

The second inequality (J,(x(j,i) = 0) - n'{x(j,i) = 0) < 2P . 5 , Zx (-oo,n\ ((j. «) <-> 
k,k] x [l,n]),Z x (— cx),n])) will be proved in two steps. We first note that 
Proposition 3.3 from [vdBS] (where in their notation A n = [—k,k] x [l,n] and the 
underlying graph G is the subgraph Z x (— oo, n] of the square lattice) implies that 

(1) n(x(j,i)=0)-f,'(x(j,i) = 0) 

= (/xX//)(3path of disagreement from (j,i) to (?(([— k, k] x [1, n]), Zx (— oo, n]j) , 

where a path of disagreement for a pair £ ({0, l}[ _fc > fc ] x [ 1 '"]) 2 is simply a path 

of vertices P for which x(p) ^ y(p) for all p E P. 
It now suffices to prove that 

(/i x //)(3path of disagreement from (j,i) to 
a(([-fe,fc]x[l,n]),Zx(-oo,n])) < 2P . 5 ((j,i) <-> S(([-fc,fc]x[l,n]),Zx(-oo,n])). 

Our proof is just a version of the argument used to prove Corollary 2.2 from 
vdBS , adapted to the finite graph [~k,k] x [l,n]. We point out first that by 
the definitions of /i and //, they are MRFs on [— k, k] x [l,n]. The fundamental 
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observation we make is that for any (j', i') G [—k, k] X [1, n] and any configurations 
rj,r]' G {0, l} 8 ({W'»»')}.[-M]x[i,n]) j 

(/xx fi')(x(j',i') ^ : ar|a({(j',i')},[-fe,fe]x[i,n]) =»/) 

fls({(j , .» / )}.[-fc.*]x[i,n]) = ?/) < 0.5. 
This is easy to check; from the definitions of /i and //, the conditional distributions 

fi(x(J',i') | a:|e({C7''>»')}>[-*.*]x[i.»]) = ^ and m'WjO 1 2/|e({C/'.«')},[-fc.*]x[i.»]) = rf) 
are always either uniformly distributed between the letters and 1, or entirely 
concentrated on the letter 0. 

Then we note that since fj, and // are MRFs, for any (j', i') G ([—k, k] x [1, n]) \ 
and admissible 5,8' G {0, i}([-M]x[i,™])\{(jV)} 5 

(2) (p x //)((x, y) has a path of disagreement 

from to (/,i') | ar|([-fc,fc]x[i,n])\{{j',i')} = <*> y|([-fe,fc]x[i,n])\{(j',i')} = ^ 
(fiXfx')(x(j,i) ^ y(j,i) | ^|([-fe,fe]x[l,n])\{(j',i')} = ^2/l([-fe,fe]x[l,n])\{(j',i')} = - °- 5 
Therefore, the probability measure on {0, l}([ _fc ' fc l x [ 1 '™l)\'t(ii 1 )} which marks paths 
of disagreements to (j, i) w.r.t. (i x \x' by Is is stochastically dominated by the 
Bernoulli measure Po.5,([-k,k]x[i,n])\{(j,i)} with respect to the standard ordering 
< on {0,1}, i.e. < 1. More rigorously, if we define a factor map r from 
({ 0il }([-fc,fc]x[i,«])\{(^)}) 2 tQ { 0)1 }([-fc,fc]x[i,n])\{OM)} by ( T (x,y))(v) = 1 iff (x,y) 
has a path of disagreement from v to (j, i), then r(/i X //) < Po.5,([-k,k]x[i,n])\{(j,i)}- 
(This is proved by constructing a coupling of t(jx x //) and Po.5.([-k,k]x[i.n])\{(j,i)} 
where the Bernoulli trials for P0.5 always dominate, which is straightforward by 
©.) Then, 



(// x //) (3path of disagreement from to k, k] x [l,n]),Z x (— oo,n])) 

< (/x X /x')(3path II, not containing (j, i), from a neighbor of (j, i) to 
o*(([— k, k] x [1, n]), Z x (-co, n]) such that for each p G n, there is a path of 
disagreement from p to (j, i)J 



< P 



m (3path of Is from a neighbor of (j, i) to 



0.5,([-fc,fc]x[l,n])\{(j 

5(([-fc,fc]x[l,n]),Zx(-oo,n])) = 2P . 5 ((j ) i) <-> 0(([-fc,fc]x[l,n]),Zx(-oo,n])). 
Combining this with (|TJ) completes the proof. 

I 

Corollary 1. Por any n, any i G [l,n], and any j, 



Mi,n( x 0"i = °)-Mr,„(^(j, = o) 



< 2P . 5 ((j,i) O d((Zx[l,n]),Zx(-ao,n])) and 
<2P . 5 ((j,i)o9((Zx [l,n]),Zx [l,oo))' 



*J,„(xO',i)=0)-A*i,n( a? C7.i)=0) 
Proof. We again prove only the first inequality, as the proof of the second is similar. 



Let k — > oo in Theorem 1121 and use the definitions of fj^ n and [i\ n as weak limits. 
Then note that it is obvious that Pq.5,s(w <-> T) < Po.s(w O T) for any 5 C Z 2 , 
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s eZ 2 , and T C S, since enlarging the universal set S to Z 2 only allows for more 
possible paths of Is from v to T. 

m 



Our next result regards closeness of the measures fj,° n and fi° n+1 in the d metric 
when restricted to horizontal strips which are two rows high. We will consider such 
restrictions as measures on the full shift ({0, lj-Wxf ' 1 }) 2 f or purposes of the 
d metric. 

Corollary 2. For any n and any integer i G [l,n— 1], 

*(A»i,T.kuji 4+1 ,A*i,n+xkuji 4+1 ) <4fl,. B ((0 ) i)«3((Zx[l I n]) 1 Zx[l,oo))) 
+ 4P .5 ((0, i + 1) o 9((Z x [l,n]),Zx [l,oo))) and 
<?(Mi,„|fl < uii <+1 ,M?, n+ iU i+1 uii i+2 ') < 4P . B ((0,i) <-> 9((Z x [l,n]),Zx (-oo,n])) 

+ 4P . 5 (((M + l) o 9((Z x [l,n]),Zx (-co, n])). 

Proof. We begin with the first inequality. The proof is fairly similar to that 

of Lemma 3 from [KamKO , but we cannot apply this directly due to the site- 

- o + 

dependence of the ordering By Lemmas [5] and |H fi° n < fi° n < Mi n an d 

o + 

A*i,n ^ Mi, n+i Izx [l.n] ^ /"in- By Lemma [21 the same inequalities hold when all 
four measures are restricted to Ri U Pj+i. Then, by using the first definition of 
stochastic dominance, the following four couplings exist: 



A coupling Ai of /x° n \R t uR i+1 and pb\ ra |_R 4 ufl i+ i supported on {(w,x) : w ^ x} 

o + 

A coupling A 2 of ^5,„|fliUfl i+ i and ^J^ui^+i supported on {(x, z) : x < z} 

- 

A coupling A 3 of ^ n \ RtURi+1 and Mi, n +l |fliUi? I+1 supported on {(iu,y) : to ^ y} 

+ 

A coupling A 4 of n° tn+1 \R i \jR i+1 and Mi,J.RiU.R i+ i supported on {(y, z) : y<z\ 

By taking the relatively independent coupling of Ai and A2 over the common 

- 

marginal fJ,° ltn \R i uR i+1 , one arrives at a coupling A 5 of Mi,J.R«uR i+ i, Mi,Jfl«u.R <+ i, 
+ 

and n° supported on {(w,x, z) : it) ^ x ^ z}. 

By taking the relatively independent coupling of A3 and A4 over the common mar- 

- 

ginal Mi.n+ll-RiURi+i, one arrives at a coupling A 6 of Mi.nlfi.ufli+i, Mi,n+lUtUii* + i, 
+ 

and A*° n |fl i ufl i+1 supported on {(w,y,z) : it) ^ y ^ z}. 

Finally, by taking the relatively independent coupling of A5 and Xq over the com- 

+ 

mon marginal /ii,n|fl«ufl i+ i XMi.nl-RiU.flw+1, one arrives at a coupling A of Mi,nlfliUii i+ i) 

00 + 
Mi,JfliUfl i+ i, Mi,n+ilfliUiJ <+1 , and Mi,Jfl«uii <+ i supported on 

{(w,x,y,z) : w di x < z,w ^ y ^ z} C ({0,1} R * UR * +I f . 

+ 

The measures /ij n and /i° „ are not crnm -invariant; in fact their definitions as weak 

+ 

limits imply that 07i,o)Mi n = Mi «• They are, however, CT^ )-invariant, and so we 
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will consider them as measures on ({0, l}* ' 1 } ) z so that we may treat them as 
stationary measures. By replacing A by any weak limit of a subsequence of the 
sequence — 53i^o C(2i,o)^-> we may also assume that A is ct( 2 . o)-invariant. We for 
now assume that i is even, and claim that 

J di(tfl(0),z(0)) d\(w,z) < (t4 }n ( X (Q,i)=0)-vi n (x(0,i) = 0)) 

+ (f4 <n (x(0, i + 1) = 0) - A*?, n (x(0, i + 1) = 0)) 

+ (/xf in (a:(l 1 <)=0)-Aif in ( a: (l,*)=0)) 

+ (nl n (x(l, i + 1) = 0) - f il n (x(l,i+ 1) = 0)), 

where w and z represent sequences on the alphabet {0, l}^ ' 1 } , but x represents a 
configuration on the alphabet {0, 1}. In fact this is fairly straightforward; we may 
assume in the integral that w r< z. This means that iu(0) ^ 2(0) only when at least 
one of the inequalities 

M0))(0,0) < (z(0))(0,0), («7(0))(0,1) > (2(0))(0,1), 

M0))(1,0) > (z(0))(l,0), M0))(1,1) < (*(0))(1,1) 

holds. However, since w and z are configurations on Ri U R%+\, it should be clear 



that 
















\({(w. 


.*) 


: M0))(0,0)<(z(0))(0,0)}) 




= 0)- 




„(s(o,0 = 


0)), 


\({(w. 


.*) 


: ( W (0))(0,l)>(z(0))(0,l)}) 


= (jt{ n (x(p,i- 


M) = 


0)- 


-mL(^(o, 


i + l) = 


\({(w. 


.*) 


: M0))(l,0)>(z(0))(l,0)}) 


= ((4,n(x(l,i) 


= 0)- 




n WM) = 


0)), and 


\({(w. 




: M0))(l,l)<(z(0))(l,l)}) 




fl) = 


o)- 


+ 


i + l) = 


Since , 




+ 

= <7 (i,o)A*i m the right-hand sides of the first and third 


inequalities each 



equal (/i° (x(0, i) = 0) — /i" . n (x(0, i) = 0)), and the right-hand sides of the second 

+ 

and fourth inequalities each equal hu° n (x{0,i + 1) = 0) — (j.° n (x(0, i + 1) = 0)). 
Then, since A is supported on 4-tuples (id, x, y, z) for which w ^ x < z and w ■< y ^ 
z, and since clearly for such 4-tuples w(0) — z(0) =>• w(0) = x(0) = y(0) = 2(0), 

/ \ 

^( Ml,n|fl«UiJ j+ i) Ml^+llfliUiJi+i J < 

/" di(s(0),»(0)) dA(aj,y) < / di(tu(0), «(())) d\(w,z) < 

2( M ° n (z(0,i) = 0)-^ n (0,i) = 0))+2(/x?, n (x(0,i+l) = 0)-nl n (x(0,i+l) = 0)), 
which by Corollary [T] is bounded from above by 

4P . 5 ((0,i) O a((Zx[l,n]),Zx[l,oo)))+4P .B ((0,i+l) O 0((Zx[l,n]),Zx[l,oo))). 

+ 

Due to the earlier rescaling forced by the non-shift invariance of nl n and n° n , this 



is in fact a bound on the d distance between (i° Tl \R i uR i+1 and Mi in +i|ii i u.R j+ i as 
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measures on ({0, l}* ' 1 ! 2 ) 2 rather than ({0, l}i°} x i°- 1 }) z , but clearly the <f distance 
in the latter case is even smaller. 

O 

To prove the second inequality, simply use /ig n instead of /i" n+1 , and note that 



M0,nl-RiUfli + i = Ml,ri+llfl l+ iUfl l+2 - 

The proofs when i are odd are almost identical, except that the orders of all 
differences above need to be switched, which does not affect the final inequality. 

■ 

Since 0.5 < p c on the square lattice, the following is clear from Theorem [2] and 
Corollary H 

Theorem 13. There exist A, B > so that for any n and i G [l,n — 1], 
tf(/4,»kuK i+1 ,Mi,„+ikufl i+1 ) <Ae- B(n - t} and 

( ° \ -Bi 

^[f J 'l,n\RiURi +1 ,tJ>°,n+l\R i +iUR i+ 2j < Ae % - 

We note that clearly Theorem [13] also implies that 

tf(/4,»k,Mln+ik) <Ae- B ^ aodd(ji tn \ Jtt , l i, n+1 \R t+1 ) <Ae~ Bi ; 

either inequality can be proved by considering a restriction of the coupling A that 
achieves the analogous d~ distance in Theorem [13] and noting that restricting from 
a strip two rows high to a single row cannot introduce new disagreements. 

Now, we can prove Theorem [TJ by using measure-theoretic conditional entropies. 
We first need some notation and a preliminary theorem. For any H n , any stationary 
measure /i on H n , and any adjacent intervals /, JC [1, n], we partition the alphabet 
A n = £{o}x[i,n](%) of H n by the letters on / U J, and call this partition £/uj- We 
also partition A n by the letters on /, and call this partition £/. Then we make the 
notations 

^(U^ 1 ) := M<^iG")) and 
iei 

^(IM I U^) : =M^wG«) I 6). 

je.J iei 

(For the sake of completeness, we note that for any /, (/z) is essentially just 
fi\\j Ri ; we use the partition notation to more easily apply Proposition [TJ) We 

note that h^ ^ lj ig/ Rij can also be thought of as ft.(/i|y ej . ^J. We also note that 
by Proposition [JJ for any / and J, 

k{ U r *) =^(U^) +^(U^ i U^)- 

keiuJ iei jeJ iei 

For uniform hard-core Gibbs measures on Z x [l,n], we will prove an important 
fact about these conditional measure-theoretic entropies, which can be thought of 
as a two-dimensional entropic analogue of the fact that the future and past of a 
one-dimensional Markov chain are conditionally independent given the present. 

Theorem 14. For any n and any adjacent intervals /, JC [l,n], 

h s (IM ■ i U^) = h % (U^ i 

jeJ iei Ml >™ jeJ 
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where i € I is the element of I adjacent to J. 

Proof. We will only prove the theorem for the case where J is above /, i.e. I — [i , i] 
and J = [i+ 1, j], as the other case is trivially similar. Also, for this proof, given a 
finite set of configurations en 6 A Si , 1 < i < k, for which the shapes Si are pairwise 
disjoint, we denote by ot\cxi . . .oik the concatenation of the a*, i.e. the configuration 
on [L =1 for which [oL^a^ ■ ■ ■ ckfc)|si = ol% for 1 < i < fc. 



For readability, we abbreviate /j° n by /j in this proof. By definition, since the 
support of fj, is contained in H n , 

je./ ie/ «>ei[-*,*i X 7(«), u J L J; 

= lim — / In ( —, \ du(w.x). 

k^oo2k + lJ Hn \fi([w]n[x])J PV ' 

We make the decomposition 

(3) fi([w]n[x})= *]nNn[i]n[4 

I6i{-H)x[l,»](«), 

fl£i{Hi}*[i,»](H) 

We recall from its definition as a weak limit that fi is a uniform hard-core Gibbs 
measure on Z x [1, n], and so for any such L and R, 



\{u e i[_fe, fe ] x ([i, n ]) (W) : 



Figure 1. iu, x, L, and i? 



In Figure [U {m e £[-fc,fc]x([i,»'-i]utf+i,n])CW) : LuwxR € is the set of 

configurations which can legally fill the shaded area. 
We may similarly decompose //([w]): 



(4) 



E 



tx([w] n [i] n [R]). 



LeL { _ fc _ 1}x[1> „]CH), 

-Rei{fc+i} X [i,„]CH) 



Since /i is a uniform hard-core Gibbs measure on Z x [1, n], for any such L and i? 

|{it S i[-fc,fe]x([i,i'-i]u[»+i,n])(^) : iuwi?eL(X)}| 



M ([Hn[L]n[i?]) = M ([£]rW 



|{U6 L[_fc, fc ]x([l,n])(^) : iui? e 
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By and (0, for any L and R such that fi([L] n [i?]) > 0, 
(5) 

m(H n [L] n [i?]) _ |{"ei[- t ,t] X ([i,i'-i]u[( + i,»])(^) : iwwi?ei(X)}| 



n [at] n [L] n [R]) \{u ei[_fc,fe]x([i,i'-i]uy+i,n])(^) = LuwxR g L(x)}\ 

Since % is a nearest neighbor SFT, 

|{u € i[-fe,fe] x ([i,i'-l]u[i+i,n])(^) : iuwi? e = 

• (\{u" € L[_fe,fe] x [i+i,n](^) : Lu'wR e i(X)}|) 

and 

|{u € £[-fc,fc] X ([i,i'-i]u[j+i,n])(^) : LuwxR G L(X)}\ = 

(|{u'6L hfe , k | x|1 , H (?i) : £uVRgL(X)}|) 
• e i[_fc,fc]xb-+l 1 n](W) : 

Therefore, ([3]) implies 

(~1 [L] n [i?]) |{«Gi[-jk,jk]x[i+i,n](W) : LuwRe L(X)}\ 



(6) 



m(H n [x] n [£] n [R]) |{«ei Kk]xb - +M (W) : L^i? e l(X)}| 



fe+2 




-it-i 



Figure 2. 



In Figure [5J we see that since is a safe symbol of H, any configuration u G 
L\— k+i,k— i]x[i+i,n]('^) f° r which uw G L(%) may be extended in at least one way 
to a configuration v! G Lr_ fc,fc]x[i+i,ra]C^) f° r which Lu'wR G L(T-L) (by placing 
columns of 0s to the left and right), and any configuration u G £[-fc+i,fc-i] x [j+i,n] (H) 
for which ux G L{R) may be extended in at least one way to a configuration 
v! G £[-fc.fc]x[j+i.n](%) f° r which Lu'xR G L(%) (by placing columns of 0s to the 
left and right.) Also, clearly there are at most 2 2 ( n_l ) possible such extensions in 
the first case and at most 2 2 ^ n ~^ possible such extensions in the second. Therefore, 

|{ueL[. k+ i ik _i] X | J ' + i in ](?{) : ux e L(X)}\ 

< |{u G £[-fc,fc]x[j+i,n](%) : LuxReL(X)}\ 

<2 2 ^\{ueL hk+hk _ 1]xU+lM {n) : uxeL(X)}\ and 
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|{u € -L[_fe + i,fe_i] X [i + i, n ](H) : uw E L(X)}\ 

< |{u S ■L[_fc i fcj X [i+i,n](?0 : LuwReL(X)}\ 

<2 2 ^\{ueL [ _ k+hk _ 1]x[t+hn] (H) : uweL(X)}\. 

Then by (©, for any choices oi L,R for which fi([L] n [i?]) > 0, 
M(Mn[L]n[fl]) 

M(MnMn[L]nW) [2 _ 2(n _ 2 2(„-j)] and SQ b p and & 

|{nGL[_ fc+ i, fc _i ]xU + i,„](H) : ua;eL(A)}| 

MM 

p(Hn[i]) r„_2(n-i) 2 2 (™-j)l 

fc + l,fc-l]x[i+l,n 
\{ueL [ _ k+1<k _ 1]> , [j+lt7l] (-H) : ux€L(X)}\ 

The original conditional entropy h^({Jj e J Rj | [J ieI Ri) is 

lim — / In ( — — . ^ dutw.x), which is equal to 

fe^oo2fc+l J Hn \/j,{[w] ("1 [x])J y ' 

lim 1 f ln f \{ u £ L [-fc+M-i]x [»+!,«] (?^) : uweL(X)}\ \ 
k^oc2k + lJ Hn \\{u £ L [ _ k+hk _ 1]x \ j+ltn] ('H) : ux e L(X)}\ J 

since the difference between the functions inside the integrals is bounded as k — > oo. 

We now note that this expression does not depend on the left endpoint i' of /, 
and so 

/v(U^ i U^) = ^(\J R i i 

jeJ iei jeJ 

m 

Proof of Theorem^ By Theorem [TT1 h n +i = h (u° jn+ ij and h n — h (/i° jn ^- By 
using Proposition [TJ we may decompose these entropies as follows: 

h n = h(ti{ n ) = h o (iZ LfJ ) 

Ml, re 

n / j—l 

+ E fe 8 K i U 

i=L9J+i Pl '" V i=LfJ 

L ^ J 1 / n n 

+ ^ /i o (flt | (J -Ri J and 

k=i ^ i=fc+i ' 



= MMl,n+l) = h « (Afj) 
Mi, n+1 

+ ft 8 (AfJ+i I ^L?j) 

Ml, n+1 

n+1 / J-l 



E \§ 1 U 

,=Lfj+2 ^"+ lV i=Lfj y 

LfJ-1 , n+1 

£ h o (iZfc | (J iZ, 



fc=l M i.«+i v i=fc+l 
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By Theorem HH these decompositions may be rewritten as 





fit 

£ ft o (i2j I Rj-i) 



j=L?J+i Ml 
LfJ-i 



E h g (-Rib I -Rfc+i) and 



fc=i 



ftn+1 = ft(Ml,„+l) - ft o (-^Lfj) 



n+1 



i=Lf J+2 
LfJ-i 

E ft § {Rk I -Rfe+i)- 



fc=i 



By taking the difference, we see that ft n +i — h n = 

(7) ft o (i? LfJ ) - ft o (i? LfJ ) 
n , 

(8) + ]T r 8 (^+! I R o) - h § I ^-i) 



j=Lf J+i 
LfJ-i 



Mi',„ +1 Mi, 



(9) 



+ E ^ s (-^ fc I Rk+i) - ft o (-R/c I ^fc+i) 



(10) +fto (iZ Lf J+1 I i? Lf j) 



Ml, n+l 





Theorem [TT1 implies that the measures n\ n+1 \r. n and /i? -J.R, „ . are ergodic. Then 
by Theorem [T3l and Theorem [lOl it is clear that ([7]) is exponentially small in n, i.e. 
there exist constants Q and R independent of n so that ([7]) < Qe~ Rn . We may 
rewrite any term in the sum (|8]) by Proposition [TJ 

ft o (-ftj+i I R j) - ft o I Rj-i) 

<„+i m?,„ 

= f ft o (i^+i U i?,) - ft o (Ji, U Rj^)) - f ft g - ft o . 

V l4,n+l /*?,» / \ Mi,„ + i Mi,„ / 

By Theorem [H 

^(Mi.n+ik-Mi.nlfii-i) <^e~ Bj and <f >n+1 |^ +lU B, , Mi,„ l^ufi^i ) < Ae' Bj . 

Since j > |_§J and all of the relevant measures are ergodic, ([5]) is exponentially 
small in n by Theorem [TUJ The proof that (O is also exponentially small in n is 
similar. 



APPROXIMATING THE HARD SQUARE ENTROPY CONSTANT 



21 



All that remains is to show that the leftover term (p~0|) approaches h at rate 
which is at least exponential in n. It suffices to show that (|I0j) approaches any 
limit at all with rate at least exponential in n; by Lemma [TJ h n+ \ — h n approaches 
h in the Cesaro limit, and /i n +i — h n differs from (If Op by an exponentially small 
amount. So, if (|10[) approaches a limit at all, it must be h. 

We note that for any n, \J 1 ^r\ is either equal to |_§ J or [f J + I- But by 
Theorem 1 13| in either event, 

Then /i o (i? L wj-i j 1 1 I iZp+i j) — /i o I i?L « j ) is exponentially small 

in n by ergodicity and Theorem 1101 implying that (If 0[) is exponentially Cauchy, 
therefore it approaches a limit with rate at least exponential in n, and we are done. 

■ 

One application of Theorem [T] is to the computability of the real number h. We 
first need to define our notion of computability. 

Definition 21. A real number a is computable in time f(n) if there exists a 
Turing machine which, on input n, outputs a pair (p n ,qn) of integers such that 
— a\ < —, and if this procedure takes less than f(n) operations for every n. We 
say that a is computable if it is computable in time f(n) for some function f(n). 

Informally speaking, a real number a is computable if it is possible to give a finite 
description of a which allows someone to reconstruct as many digits of the decimal 
expansion of a as desired. For instance, e is computable since we can describe it as 
the sum of the reciprocals of the factorials of nonnegative numbers. All algebraic 
numbers are computable, but there are many more computable numbers than al- 
gebraic (though still only countably many.) For an introduction to computability 
theory, see |Ko| . 

Theorem 15. h is computable in polynomial time. (There exists a polynomial p(n) 
for which h is computable in time p{n).) 

Proof. Recall from Section Q] that for any Z nearest neighbor SFT X, h top (X) is 
the logarithm of the Perron eigenvalue of an associated matrix called its transition 
matrix. Since we will need a few relevant properties of these matrices, we quickly 
define them for Z nearest neighbor SFTs. Given a Z nearest neighbor SFT, which 
we assume without loss of generality to have alphabet [1, |^4|], the transition matrix 
B is a square 0-f matrix with size \A\, where £>y is if the adjacency ij is not 
allowed and I if the adjacency ij is allowed. 

Define, for any n, B n to be the transition matrix for H n = H n (T-L). Then B n is a 
square matrix with size s n := LA^ x [! n i . Since the horizontal adjacency conditions 
for % are symmetric (ij is legal if and only if ji is legal), the same is true for H n , 
and so all B n are symmetric. For T-L, the algorithm from [Pij mentioned in SectionQ] 
for generating any B n takes exponential time in n. (Briefly, one constructs B n+ i 
from B n by arranging four copies of B n in a square, and then by replacing the right 
half of the upper-right copy of B n , the upper half of the lower- left copy of B n , and 
the entire lower-right copy of B n by Os. The number of operations taken to generate 
this matrix is of the same order as the number of operations it takes to write down 
the entries, of which there are exponentially many in n.) Also, B n is nonnegative 
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real and symmetric, therefore it has all real eigenvalues, which we denote by A n> i, 
A„,2, A„ !Sn , where A„,i > |A n ,2| > ... > |A n , s J. For any positive integer k, 
tr((B n ) k ) = Yli=i i> an< ^ so if we assume k to be even, then 

A*,! < tr((B n ) k ) < s n X k nA . 

Since s n < 2™, 

A„,i < [tr((B n ) k )]i < 2*A„,i. 
If we choose k = 8 n , then k > rt4", and so 

A„,i < [tr((S„) 8 ")]^ < 2 4_ ' l A„ 4 . 

Since B n is a 0-1 matrix, X n i is less than or equal to the size s n of £?„, which 
is in turn less than 2". Combining this with the fact that 2 4 < 1 + 4 _n yields 
A„,i - [tr((B„) 8 ")]s^| < 2~ n . Also, the calculation of [tr((B„) 8 " )] ^ takes expo- 
nentially many steps in n; one simply needs to start with B n and square 3n times, 
then add the diagonal entries and take the result to the g^- power. 

Therefore, by investing exponentially many steps in n, it is possible to achieve 
approximations h n+ \ and h n which are exponentially close to h n+ i and h n respec- 
tively, and then by Theorem [TJ h n+ \ — h n is exponentially close to h. 

In other words, there exist C, D, E, and F so that for every n, there is an 
approximation, computable in less than Ce Dn steps, which is within Ee~ Fn of h. 
But then for any integer m, E?e~ F ( n+1 ' < — < Ee~ Fn for some n, and so one 
can approximate h to within ^ in at most Ce £) (" +1 - ) steps. Since m > -ge F ™, the 
number of steps required for the approximation is at most Ce D (mE)T' , which is 
clearly a polynomial in m. 

m 

The fact that h is computable follows from a more general result in [HocMj , but 
to our knowledge, very little was known about the rate. Another consequence of 
[HocMj is that there exist 1? SFTs whose entropies are computable with arbitrarily 
poor time (along with entropies which are not computable at all!), so Theorem 1151 
at least implies that h is "nice" within the class of entropies of SFTs. Though not 
as good as a closed form, this is still satisfying; since % is the simplest possible 
nondegenerate Z 2 SFT, one would hope for its entropy to be a relatively simple 
number. 

5. A COUNTEREXAMPLE 

Interestingly, it is not true for all I? SFTs that h n+ i(X) — h n (X) converges to a 
limit. This was shown by an example in [Pi] . However, this example was somewhat 
degenerate in that it was periodic, and in particular not topologically mixing. 

Definition 22. A Z d subshift X is topologically mixing if for any finite rectan- 
gular prisms S, T C lj d , there exists Rs,t so that for any translations S' and T' of 
S and T respectively such that \\s' — i'Hoc, > Rs.t for all s' G S' and t' € T" , and 
for any globally admissible configurations u £ Ls'(X) and v G Lt>{X), there exists 
x G X such that x\s> = u and x\t> = v. 
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In other words, X is topologically mixing if it is possible to see any two globally 
admissible configurations at any desired locations within the same point of X, pro- 
vided that you allow enough distance between them. Topological mixing is a strong 
condition for Z SFTs, and is a sufficient hypothesis for many theorems. However, 
for Z d SFTs with d > 1, topological mixing is a somewhat weak property. For 
many theorems in Z d symbolic dynamics (see [D], |D2) . |JM) . |Ligh| , and |Ligh2| ), 
it is necessary to assume a uniform mixing property, i.e. one where the distance re- 
quired to see two globally admissible configurations simultaneously is independent 
of their size. There is a hierarchy of uniform mixing conditions in Z d , including 
block gluing, corner gluing, the uniform filling property, strong irreducibility, and 
square filling mixing. (See [B oPS] for definitions of and some exposition on the 
conditions in this hierarchy.) 

We can modify the example from [Pi] to see that the weakest uniform mixing 
condition, block gluing, is not enough to ensure convergence of h n+ \(X) — h n {X). 

Definition 23. A 1 d subshift X is block gluing if there exists R such that for any 
finite rectangular prisms S,T C 7L d satisfying \\s — tW^ > R for all s £ S and all 
t 6 T . and for any globally admissible configurations u € Ls{X) and v € Lt(X), 
there exists x € X such that x\s = u and x\t = v. We call the minimum such R 
the filling length of X. 

We will not define any other uniform mixing conditions except to say that the 
stronger conditions have the same spirit, but enlarge the class of configurations 
which are considered. For instance, strong irreducibility is defined by considering 
any pair of globally admissible configurations, whether their shapes are rectangular 
prisms or something more complicated. 

Theorem 16. There exists a block gluing Z 2 nearest neighbor SFT Y for which 
liniji^oo h n+ i(Y) — h n (Y) does not exist. 

We begin by defining Y, which is a slightly different version of the SFT X^g 
defined in [BoPS . The alphabet A of Y consists of the integers 0,1, ... ,k for any 
k > (8 ■ 48 2 ) 2 = 339738624, along with the symbols si, S2, S3, S4, S5, sq (illustrated 
in Figure [3|), which we call grid symbols. The legal adjacent pairs of grid symbols 
are those where the line segments which meet the edges "match up," and which do 
not yield parallel line segments at a unit distance which do not meet. For instance, 
S1S2 is forbidden since the horizontal line segment meeting the right edge of si does 
not match up with any horizontal line segment meeting the left edge of S2, and the 
pairs S2S6 and S3S4 are forbidden since each pair would contain a pair of vertical 
line segments at unit distance which do not meet. Adjacencies between integers 
are as follows: may only appear horizontally adjacent to 0, may not appear 
vertically adjacent to 0, and non-0 integers may not be vertically adjacent. The 
only integer allowed to appear above a grid symbol is 0, and there are no other 
restrictions on adjacencies between grid symbols and non-grid symbols. 



Figure 3. Grid symbols in the alphabet A 
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The net effect of all of this is that any point y € Y has grid symbols partitioning 
the plane into rectangles (possibly infinite), and on each rectangle y is labeled with 
integers, where the rows alternate between rows of all Os and rows of arbitrary 
strings of non-0 integers between 1 and k. In any such rectangle (finite or infinite) 
with a bottom row, this row must be labeled with Os. 

First, we will verify that Y is block gluing with filling length 9. Consider any two 
rectangular configurations w and w' which are globally admissible in Y. Without 
loss of generality, we assume that both w and w' have shape [l,n] 2 . For any 
v G Z 2 with ||u||oo > n + 9, we will construct x € A z for which a;|ri n i2 = w and 
a; l[i,n] 2 +u = w ' ' ■ First, place w and w' at the corresponding locations, as in Figure^ 
Clearly either the horizontal separation or vertical separation between w and w is 
greater than 9, and for now we assume that it is the vertical separation. 



5 6 6 



2 4 






8 


2 


4 7 9 3 




Figure 4. w and w' 

First, we will extend each of w and w' to a slightly larger square. We describe 
the procedure only for w, as the corresponding procedure for w' is completely 
analogous. We begin by placing grid symbols on the border of [— 3,n + 4] 2 , i.e. 
at a distance of 4 from w. The top and bottom edges are labeled with horizontal 
lines (the symbol si), the left and right edges are labeled with vertical lines (the 
symbol S2), the lower two corners are labeled with the symbol S3, and the upper- 
two corners are labeled with the symbol S5. Denote the square [— 3,n + 4] 2 by 
B. For each edge of w, look for any grid symbols which contain a line segment 
which hits the boundary of w, and extend such segments to the corresponding edge 
of B by using a string of grid symbols s\ or S2 (along with the proper "joining" 
symbol S3, S4, S5, or sq when this string hits the edge.) This partitions B into 
rectangles, which we would like to fill with integers. Any empty rectangles are easy 
to fill, and we can almost just complete the rectangles which already contain some 
integers from w in a locally admissible way, but there is one slight problem; when 
continuing the pattern of alternating rows of Os and rows of non-0 integers begun 
by a partially filled rectangle, we could end up with a non-0 integer above one of 
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the horizontal line grid symbols along the bottom edge of B, which is illegal. This 
is easily addressed though: before filling in any rectangles, consider any interval 
of integers on the bottom edge of w. If such an interval is made up of Os, place a 
horizontal line of grid symbols below it to end its rectangle. If an interval is made 
up of non-0 integers, place a row of Os below it, and then place a horizontal line 
of grid symbols below that. Again extend any incomplete paths to the boundary 
of B, and since each rectangle intersecting the bottom edge of B is now empty, 
it is possible to fill all rectangles with integers, without changing w, in a locally 
admissible way. The resulting configuration on B (and the corresponding one on 
B') is locally admissible. To fill the rest of Z 2 , we simply extend the segments of 
horizontal line grid symbols on the top and bottom edges of both B and B 1 infinitely 
to the left and right, and fill in the resulting empty infinite rectangles with integers 
in any locally admissible way. (This procedure is illustrated in Figure [5]) 

If instead the horizontal separation between w and w' was at least 9, then the 
only changes to the above construction would be to use grid symbols S4 and sq on 
the corners of B and B' , and to extend the segments of vertical line grid symbols 
on the left and right edges of B and B' infinitely upwards and downwards instead. 
Since w and w' must have been in one of these two situations, we have proved that 
Y is block gluing. 



5 6 6 2 4 




4 7 9 3 




7 6 11 




2 7 9 4 




Figure 5. Interpolating between w and w' 



We will now verify that linin^oo h n +\{Y) — h n (Y) does not exist. The basic idea 
is that most of the entropy is contributed only by the integer symbols in A, and 
that the entropy contributed by these symbols grows a lot when transitioning from 
a strip of height In to a strip of height 2n + 1, and not as much when transitioning 
from a strip of height 2n — 1 to a strip of height 2n. 



26 



RONNIE PAVLOV 



Fix any n,m G N. We will bound |i^4[i, m ]x[i,2n-i](^)| from above and below. 
The lower bound is easy: by considering configurations labeled by alternating rows 
of Os and non-0 integers with non-0 rows on the top and bottom, we quickly see that 
|-^[i,m]x[i,2n-i]Q0l > k mn . For the upper bound, we have to work a bit harder. 
Consider any configuration w G £-<4-[i,m]x[i,2n-i]Q0 which contains g grid symbols 
for some g G [1, (2n — l)m]. We first bound from above the number of ways that 
these grid symbols can be placed. 

The key point in our argument is that since, in points of Y, each grid symbol 
must be adjacent to at least two other grid symbols and there are no grid symbols 
consisting of only a corner, there are no locally admissible closed finite loops of 
grid symbols. Therefore, any grid symbol in the interior of a locally admissible 
configuration with shape [1, to] x [1, 2n — 1] is part of a path of adjacent grid symbols 
which hits the boundary of [l,m] x [l,2n — 1] at least twice (once entering, once 
leaving.) This enables us to design an algorithm which allows a Turing machine to 
recreate any locally admissible configuration of g grid symbols on [1, to] x [1, 2n — 1] 
given a specific piece of input consisting of a finite ordered list L of coordinates 
on the boundary of [1, to] x [1, 2n — 1] and a <?-tuple X of instructions taken from 
a set of 48 different commands. The list L consists of sites on the border of the 
rectangle [l,m] x [l,2n — 1]. Every instruction in X is itself a 3-tuple (di,fi,di) 
(1 < i < to), where a, G {si, S2, S3, S4, S5, sq} represents one of the six grid symbols 
in the alphabet of Y, fi G {0, 1} is a flag that signals either "revert" or "continue," 
and di G {up, down, right, left} is one of the four standard directions in Z 2 . Now the 
Turing machine processes its input and puts down grid symbols on [1, m] x [1, 2n — 1] 
as follows: the machine starts by moving its writing-head to the coordinate given 
by the first entry in the list L (if L is empty, the algorithm stops here.) There it 
puts down the symbol a\ from the first instruction in X starting a finite part of 
some path. If fx is "continue," it moves its writing-head one step in the direction 
given by d\, where it executes the next instruction in the same manner. If some fi 
is "revert," the machine moves back along the grid symbols written so far until it 
comes to the first junction (one of the symbols {S3, S4, s 5 , sq}) where one of the three 
branches is a dead-end (i.e. the branch points to a place still inside [1, to] x [1, 2n — 1] 
where the machine has not already placed another grid symbol.) From there, the 
machine moves one step in the direction specified by di and continues with the 
(i + l)th instruction. If there is no dead-end, the machine moves its writing-head 
to the next coordinate from the list L, where it starts another path of grid symbols 
using the next instruction from X. After executing all commands in I, the machine 
has placed exactly g non-blanks. 

We claim that every locally admissible configuration w consisting of g grid sym- 
bols can be created by our Turing machine using some input. If g = 0, clearly the 
empty input suffices. If g > 0, then there is some grid symbol on the border of 
[1, to] x [1, 2n — 1] , which we can take to be the first site in L. Then, follow any path 
of adjacent grid symbols in w, recording the proper entries of X, until you either run 
into the border of [1, m] x [1, 2n — 1], or will be forced to run into an already visited 
grid symbol. If you have visited all g grid symbols in w, then you are done. Since 
w does not contain closed finite loops, if there are still unvisited grid symbols in u>, 
then they are all either connected to an already visited grid symbol or connected 
to the border of [l,m] x [l,2n — 1] by a path of adjacent grid symbols. So, we 
can record an entry of X with fi "revert," and either move back to the first place 
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along your path where you could continue to unvisited grid symbols, or, if this is 
impossible, begin with an unvisited grid symbol on the border of [l,m] x [l,2n — 1], 
append this site to L, and continue. In this fashion, we can eventually visit all g 
grid symbols in w, simultaneously recording the input which will recreate w. 

Therefore, the number of different input "programs" gives an upper bound on 
the number of ways to place g grid symbols on [l,m] x [l,2n — 1] in a locally 
admissible way. By overestimating the number of lists L by the number of subsets 
of the boundary of [1, m] x [1, 2n - 1], we get an upper bound of 2 4 ™+ 2m - 6 48 9 . 

Now, fix any locally admissible assignment of g grid symbols. We wish to bound 
from above the number of ways to fill in the leftover rectangles with integers in 
a locally admissible way. For any w G £^4[i,m]x[i,2n-i](X); consider a column of 
w which has h grid symbols in it. This column consists of alternating intervals 
of integers and grid symbols. Due to the restriction that non-0 integers cannot 
appear above grid symbols, each one of these intervals of integers has at most half 
non-0 integers, except possibly for the bottom-most interval, which could have one 
more non-0 integer than 0. This means that the total number of non-0 integers in 
the column is at most n—\. Since the only choice for each interval of integers is 
whether its bottom-most integer is or non-0 and which non-0 integers to use, and 
since only the bottom-most interval admits a choice about whether its bottom-most 
integer is or non-0, this implies that the total number of ways of filling the leftover 
portion of this column with integers is at most 2-k n ~2 . Therefore, the total number 
of ways to extend any fixed locally admissible grid symbol configuration containing 
g grid symbols to a locally admissible configuration on all of [1, m] x [1, 2n — 1] is 
at most 2 m fc mn ~i, and so 

(2n-l)m 

|£"4[i,m]x[i,2n-i](*0l < 2 4n+2m - 6 48 9 2 m fc m "-* < ((2n-l)m+l)2 4n+3m - 6 fc" 

3=0 

(Here the last inequality uses the fact that k > 48 2 .) Combining with the earlier 
lower bound on \LA[ l m } x [ 12n -i]{Y)\, taking logarithms, dividing by m, and letting 
in — y oo yields the bounds nln k < h 2n -i(Y) < n In k + In 8. 

We will now achieve similar bounds on |_LAn TO i x r 1;27 j](5^)|. Again, we may arrive 
at a lower bound by considering only configurations of integers: |£^4[i, m ]x[i,2ri](^)l > 
k mn . By the same proof as before, the number of ways that g grid symbols can 
be placed on [l,m] x [l,2n] in a locally admissible way is less than 2 4 ™+ 2m ~ 4 48 5 . 
Also by the same proof, the number of ways to fill a column with h grid symbols 
in a locally admissible way is at most 2 • k n ~ . We note that if h = 0, then 
the number of ways to fill the column is clearly 2k n , and so our upper bound is 
min(2/c™,2 • k n ~). The number of ways to complete a fixed locally admissible 
grid symbol configuration containing g grid symbols to a locally admissible config- 
uration on all of [1, m] x [1, 2n] is then at most min(2 m fc m ™, 2 m k mn ~ s ^'), and we 
get the upper bound 

2nm 

\LA [hm]x[h2n] (Y)\ < 2 4 "+ 2m - 4 48 ff 2 m min(fc" m ,fc m "- 4 ^ 1 ) 

9=0 

2mn 

= 2 4n+3m - 4 489 min(/c" m , fc 1 """^) 
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< 2 4n+3m-4j^ 48 2m fc mn + 48 9 /c m ""* < 2 4n+3m - i (2mn + l)48 2m fc mn . 

g=0 g=2m+l 

(The last inequality uses the fact that k > 48 4 .) Combining with the earlier 
lower bound on |L J 4[ ljm ] x [ lj2 n](^)|, taking logarithms, dividing by to, and letting 
to — t* oo yields the bounds nlnfc < h2 n {Y) < nln/c + ln(8 • 48 2 ). But then for any 
n, h 2n+1 (Y) - h 2n (Y) > Ink - ln(8 • 48 2 ) and h 2n (Y) - h 2n ^(Y) < ln(8 • 48 2 ). 
Since k > (8 • 48 2 ) 2 , this means that there exists e > so that h 2n+ i (Y) — h 2n (Y) > 
h 2n {Y) — h 2n -i(Y) + e for all n, and so h n+ i(Y) — h n (Y) does not approach a limit 
as n — > oo. 



6. Questions 

There are several questions which suggest themselves from this work. Firstly, 
though we have shown that h n+ \ — h n — > h at a rate which is at least exponential, 
we have not been able to give any explicit bound for this rate. 

Question 1. Is it possible to give explicit values of A and B for which h n+ \ — h n < 
Ae~ Bn ? 

The answer to this question would be interesting both because it might allow us 
to improve the known bounds on h and also because it would allow us to give an 
explicit polynomial upper bound on the time of computability of h. In order to find 
such A and B, it would be sufficient to give an explicit such A and B for p = 0.5 in 
Theorem [3 but it seems that finding these is somewhat difficult. We note that for 
much smaller percolation probabilities than 0.5, giving explicit values for A and B 
is easy. For instance, if p < 0.25, then since there are less than 4* paths from to 
d([~n,n} 2 ) of length t for any t, P p (0 O d([~n,n} 2 )) < Et=„(#)* = T^^pT '■ 

Question 2. Is it possible to extend these methods to a larger class ofli 2 SFTs? 

The difficulty here is that our proof relies on two important properties of %. 
First, there must be some (possibly site-dependent) ordering on the alphabet for 
which the fundamental Theorem [S] is true, and this does not seem to be true for 
all shifts of finite type. Secondly, in order to use the methods of [vdBSj to prove 
exponential closeness of the relevant measures with respect to d, the SFT must 
satisfy a quite restrictive property related to conditional probability of disagreement 
at a pair of sites given their neighbors. (For most 1? SFTs, the 0.5 in Theorem [T2l 
becomes a number larger than p c , which means that we cannot show exponential 
decay.) So far, we have not been able to find any nondegenerate 1? SFTs besides 
the hard square shift which have both of these properties, but it is possible that 
with a slightly different method, one could consider a wider class of systems. 

Question 3. Is it possible to apply these methods to the Z d hard square shift for 
d>2? 

The difficulty here is that already p c (Z <1 ) < 0.5 for d = 3 QCamR]), which causes 
a problem with using Theorem rT2] to imply exponential decay of d distance. 
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